Estimating the Prediction Function and the Number of Unseen Species in Sampling with Replacement
نویسندگان
چکیده
A sample of N units is taken from a population consisting of an unknown number of species. We are interested in estimating the number of species and the prediction function for future sampling. The prediction function is defined as the expected number of new species that will be found if an additional sample of size tN is taken, for any positive real number t. In this paper we point out that an estimator suggested by Efron & Thisted (1976) lack some essential properties of the true prediction function, e.g., the property of alternating copositivity. As a result, it cannot be used for large values of t. We propose an alternative estimator which possesses the essential properties, and is easily obtained. We illustrate our estimator with two numerical examples and a simulation study.
منابع مشابه
Estimating Nitrogen and Acid Detergent Fiber Contents of Grass Species using Near Infrared Reflectance Spectroscopy (NIRS)
Chemical assessments of forage clearly determine the forage quality; however, traditional methods of analysis are somehow time consuming, costly, and technically demanding. Near Infrared Reflectance Spectroscopy (NIRS) has been reported as a method for evaluating chemical composition of agriculture products, food, and forage and has several advantages over chemical analyses such as conducting c...
متن کاملThe efficiency of sampling indices in estimating the spatial pattern of wooden species in central zagros forests (Kalkhani forest in Kouhdasht, Lorestan province, Iran)
It is so important to apply suitable methods to have a reliable estimation of the spatial distribution of trees. This research was aimed to determine and evaluate the spatial pattern of five species by distance- and density-based indices (Quercus brantii, Acer moncepesulanum, Crataegus aronia, Pistacia atlantica & Amygdalus lycioides) in the Kalkhani Forest in Koudasht Lorestan province, Iran. ...
متن کاملIntroduction to a paper by Efron and Thisted ‘ Estimating the number of unseen species : How many words did Shakespeare know ? ’
This paper is the first of two written by Brad Efron and Ron Thisted studying the frequency distribution of words in the Shakespearean canon. The key idea due to Fisher in the context of sampling of species is simple and elegant. When applied to Shakespeare the idea appears to be preposterous: an author has a personal vocabulary of word species represented by a distribution G, and text is gener...
متن کاملPrediction of potential habitat distribution of Artemisia sieberi Besser using data-driven methods in Poshtkouh rangelands of Yazd province
The present study aimed to model potential habitat distribution of A. sieberi, and its ecological requirements using generalized additive model (GAM) and classification and regression tree (CART) in in the Poshtkouh rangelands of Yazd province. For this purpose, pure habitats of the species was delineated and the species presence data was recorded by the systematic-randomize sampling method. Us...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کامل